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Abstract — Distributed opportunistic scheduling (DOS) is stud- 
ied for wireless ad-hoc networks in which many links contend 
for the channel using random access before data transmissions. 
Simply put, DOS involves a process of joint channel probing and 
distributed scheduling for ad-hoc (peer-to-peer) communications. 
Since, in practice, link conditions are estimated with noisy 
observations, the transmission rate has to be backed off from 
the estimated rate to avoid transmission outages. Then, a natural 
question to ask is whether it is worthwhile for the link with 
successful contention to perform further channel probing to 
mitigate estimation errors, at the cost of additional probing. Thus 
motivated, this work investigates DOS with two-level channel 
probing by optimizing the tradeoff between the throughput 
gain from more accurate rate estimation and the resulting 
additional delay. Capitalizing on optimal stopping theory with 
incomplete information, we show that the optimal scheduling 
policy is threshold-based and is characterized by either one or 
two thresholds, depending on network settings. Necessary and 
sufficient conditions for both cases are rigorously established. 
In particular, our analysis reveals that performing second-level 
channel probing is optimal when the first-level estimated channel 
condition falls in between the two thresholds. Numerical results 
are provided to illustrate the effectiveness of the proposed DOS 
with two-level channel probing. We also extend our study to the 
case with limited feedback, where the feedback from the receiver 
to its transmitter takes the form of (0, 1, e). 



I. Introduction 

Channel-aware scheduling has recently emerged as a 
promising technique to harness the rich diversities inherent 
in wireless networks. In channel-aware scheduling, a joint 
physical layer (PHY)/medium access control (MAC) optimiza- 
tion is utilized to improve network throughput by scheduling 
links with good channel conditions for data transmissions [1], 
[10], [14]. While most existing studies in the literature focus 
on centralized scheduling (see, e.g., [4], [7], [9], [10], [14]), 
some initial steps have been taken by the authors to develop 
distributed opportunistic scheduling (DOS) to reap multiuser 
diversity and time diversity in wireless ad-hoc networks [17]. 

The DOS framework considers an ad-hoc network in which 
many links contend for the same channel using random 
access, e.g., carrier-sense multiple-access (CSMA). However, 
random access protocols provide no guarantee that a successful 
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channel contention is necessarily attained by a link with good 
channel conditions. From a holistic perspective, a successful 
link with poor channel conditions should forgo its data trans- 
mission and let all links re-contend for the channel. This is 
because after further channel probing, it is more likely for 
a link with better channel conditions to take the channel, 
yielding possibly higher throughput. In this way, multiuser 
diversity across links and time diversity across time can be 
exploited in a joint manner. However, each channel probing 
incurs a cost of contention time. The desired tradeoff between 
the throughput gain from better channel conditions and the cost 
for further probing reduces to judiciously choosing an optimal 
stopping rule for channel probing and the transmission rate 
for throughput maximization. Using optimal stopping theory 
(OST), it is shown in [17] that the optimal scheduling scheme 
turns out to be a pure threshold policy: The successful link pro- 
ceeds to transmit data only if its supportable rate is higher than 
the pre-designed threshold; otherwise, it skips the transmission 
opportunity and lets all other links re-contend. In general, 
threshold-based scheduling uses local information only and 
hence it is amenable to easy distributed implementation in 
practical systems. 

The initial study of DOS [17] hinges upon a key assumption 
that the channel state information (CSI) is perfectly available 
at the receiver. In practice, the link rates are estimated with 
noisy observations. It is shown in [13] that the signal-to- 
noise ratio (SNR) estimated by the minimum mean squared 
error (MMSE) method tends to be larger than the "actual 
SNR". Thus, the transmission rate must be backed off from 
the estimated rate in order to avoid transmission outages. Our 
initial steps in [18] show that the optimal scheduling policy 
under noisy channel estimation still has a threshold structure. 

Despite their robust performance under noisy channel es- 
timation, the linear backoff schemes proposed in [18] back 
off the rate by a factor that is proportional to the channel 
estimation errors, which may lead to severe throughput degra- 
dation, especially in the low SNR regime, due to a more 
conservative rate backoff. Recently, wideband communications 
(e.g., ultra-wideband), has attracted significant attention [15], 
owing to its low-power operation and the ability to co-exist 
with other legacy networks, etc. The great potential of wide- 
band communications gives an impetus to address the problem 
of throughput degradation, due to estimation errors, in the 
low-SNR (wideband) regime. More specifically, to circumvent 
this drawback, a plausible solution is to mitigate the rate 
estimation errors by performing further channel probing. In 
the sequel, we call the initial rate estimation performed during 
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the channel contention as "first-level probing", whereas the 
subsequent probing performed after the successful contention 
is referred to as "second-level probing". Clearly, the improved 
rate estimation obtained with second-level probing enables 
the desired link to make more accurate decisions. However, 
the advantages of second-level probing come at the price of 
additional delay. This gives rise to two important questions: 
1) Is it worthwhile for the link with successful contention to 
perform further channel probing to refine the rate estimate, at 
the cost of additional probing? 2) While there is always a gain 
in the transmission rate due to the refinement, how much can 
one bargain with the additional probing overhead? 

Specifically, we consider distributed opportunistic schedul- 
ing with two-level channel probing. Based on the recent 
advances in OST, namely OST with two-level incomplete 
information [12] and statistical versions of "prophet inequal- 
ities" [2], we provide a rigorous characterization of the 
scheduling strategy that optimizes the tradeoff between the 
throughput gain achieved by second-level channel probing and 
the resulting additional delay. It is shown that the optimal 
scheduling strategy is threshold-based and is characterized 
by either one or two thresholds, depending on the system 
parameters. By establishing the corresponding necessary and 
sufficient conditions for these two cases, we show that the 
second-level probing can significantly improve the system 
throughput when the estimated rate via first-level probing 
falls in between the two thresholds. In such scenarios, the 
cost of addition delay can be well justified by the throughput 
enhancement using the second-level channel probing. 

Our intuition is as follows: When the link rate is small, 
it makes sense to give up the transmission, since the gain 
due to rate refinement would be marginal due to the poor 
link condition. On the other hand, when the rate is large 
enough, it may not be advantageous to perform additional 
probing as the refinement is meager. Then, it is natural to 
expect that there exists a "gray area" between these two 
extremes where significant gains are possible by refining the 
rate estimate with additional probing. We elaborate further 
on this in Section [ill] Finally, through numerical results, we 
illustrate the effectiveness of the proposed scheduling scheme. 

Before proceeding further, the main contributions distin- 
guishing this work from other existing works should be 
emphasized. OST under two levels of incomplete information 
is addressed with the objective of maximizing the net-return 
in [12]; in contrast, we study OST with two levels of probing 
as applied to DOS with the objective of maximizing the rate 
of return (i.e., the throughput). Moreover, the optimal strategy 
proposed in [12] presents itself in two mutually exclusive 
types for decision-making, whereas, our study here reveals 
that, depending on the parameters, there are more options 
for DOS, and the corresponding optimal strategy is more 
sophisticated. Furthermore, we study distributed opportunistic 
scheduling for ad-hoc communications under noisy conditions 
where the rate estimate is available only after a successful 
channel contention; and this is clearly different from [13] 
which considers centralized scheduling assuming that the rate 
estimates of all links are available a priori at the base station. 
Despite the fact that both this work and [18] study dis- 



tributed opportunistic scheduling with imperfect information, 
this work concentrates on proactively improving throughput by 
enhancing rate estimation, whereas [18] proposes to passively 
reduce data rate to avoid transmission outages. Another related 
work [16] uses optimal stopping theory to investigate the 
intrinsic trade-off between energy and delay in distributed data 
aggregation and forwarding in sensor networks. 

The rest of the paper is organized as follows. In Sec- 
tion III-AI we present the system model, and in Section IH-BI 
we provide background on DOS with only first-level probing 
in noisy environments, in Section Hill we present second-level 
channel probing and characterize the optimal DOS with two- 
level probing. We also present numerical results to illustrate 
the gain due to two-level probing. In Section |IV] we extend 
our study to the case where there is limited feedback from 
the receiver to its transmitter. Finally, Section [V] contains our 
conclusions. 

Notation : | • | denotes the amplitude of the enclosed complex- 
valued quantity. We use E[-] for expectation. 

II. System Model 
A. System model and overview 




Fig. 1. Illustration of the ad-hoc network under consideration. 

Consider a single-hop ad-hoc network in which L links 
contend for the channel using random access, as illustrated in 
Fig.[U A collision model is assumed for random access, where 
a channel contention of a link is said to be successful if no 
other links transmit at the same time. Let pi be the probability 
that link I contends for the channel, I — X,...,L, Then 
the overall successful contention probability, p s , is given by 
Ps = E^=i [piYlr^ii 1 ~ pS) ( cf - [3])- For ease of exposition, 
we assume that the contention probabilities, {pi\, remain 
fixed (our studies with adaptive contention probability are 
underway [6]). We define the random duration of achieving 
one successful channel contention as one round of channel 
probing. Clearly, the number of slots in each probing round, 
K, is a geometric random variable, i.e., K ~ G(p s ). Denoting 
the slot duration by r, the corresponding random duration 
for one probing round thus becomes Kt, with its expected 
value being r/p s . We assume that a link can estimate its 
link conditions (hence, the transmission rate) after a successful 
contention^ 

'The successful link can carry out its rate estimation via a training phase 
during the request-to-send/clear-to-send (RTS/CTS) handshake, which follows 
a successful contention. This procedure is fairly standard in the literature, and 
is not dealt here. 
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Let s(n) denote the successful link in the n-fh round of 
channel probing. Due to the nature of wireless channels, 
the rate in each probing round is random. Following the 
standard assumption on block fading channels in wireless 
communications [11], we assume that the channel remains 
constant for a duration, T (i.e., T is more or less the channel 
coherence time). When an estimate of the transmission rate 
is available, the successful link may decide to transmit over 
a duration, T, if the rate is high enough, or may skip i0 and 
allow all links to re-contend, in the hope that another link with 
a better channel will take the channel later. 

To get a more concrete sense of joint channel probing and 
distributed scheduling, we depict, in Fig. [2] an example with 
N rounds of channel probing and one single data transmission. 
Specifically, suppose after the first round of channel probing 
with a duration of K\ slots, the rate of link s(l) is very small 
(indicating a poor channel condition); and as a result, s(l) 
gives up this transmission opportunity and lets all links re- 
contend. Then, after the second successful contention with a 
duration of K 2 slots, link s(2) also gives up the transmission 
because its rate is also small. This continues for TV rounds 
until link s(N) transmits because its transmission rate is good. 
Clearly, there exists a tradeoff between the throughput gain 
from better channel conditions and the cost for further probing. 
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Fig. 2. A sample realization of channel probing and data transmission. 

In [17], we show that the process of joint channel probing 
and distributed scheduling can be treated as a team game in 
which all links collaborate to maximize rate of return (the 
average throughput). Specifically, as illustrated in Fig. |2j after 
one round of channel probing, a stopping rule TV decides 
whether the successful link carries out data transmission, or 
simply skips this opportunity to let all links re-contend. Let 
R n be the transmission rate of the successful link in the n-th 
round of channel probing, and T n = Xw=i Kj T + T be the 
total system time, defined as the sum of the contention time 
and the transmission duration. It turns out that the optimal 
DOS strategy achieving the maximum throughput hinges on 
the optimal stopping rule TV*, which yields the maximal rate 
of return 9*. That is, 

E[R N T] 



and 



where 



N* 



= SUP PIT 1 ' 
NeQ E[1 N \ 

E[R N T] 
arg max - — . 

N&Q E[T N \ 



Q = {N :N> 1,E[T N ] < oo}. 



(1) 



(2) 



(3) 



It is clear that R n plays a critical role in distributed 
opportunistic scheduling. In practice, rate estimates are seldom 
perfect. It is shown in [13] that the rate corresponding to the 
estimated SNR tends to be greater than the actual rate, and 
subsequently the transmission rate must be backed-off from 
the estimated rate to avoid outages. Then, a natural question 
to ask is whether it is worthwhile for the link with successful 
contention to perform further channel probing to refine the 
channel estimate, at the cost of additional probing overhead, 
and how much can one bargain? 

Intuitively speaking, when the transmission rate is small, it 
makes sense to give up the transmission, since the gain due 
to rate refinement would be marginal due to the poor link 
conditions. On the other hand, when the rate is large enough, 
it may not be advantageous to perform additional probing as 
the improvement is meager. It is natural to expect that there 
exists a "gray area" between these extremes where significant 
gains are possible by refining the rate estimate with additional 
probing. In what follows, we seek a clear understanding of the 
above fundamental issues. 

To this end, we present the PHY-layer signal model first. 
The received signal corresponding to s(n) can be written a^| 



(4) 



where p is the normalized receiver SNR, h s t n ){n) is the chan- 
nel gain for link s(n), X s ^(n) is the transmitted signal with 



-E[|Xj {n )(n)| ] = 1, and £, s (n)( n ) is additive white Gaussian 
noise (AWGN) with unit variance. In this work, we consider 
a homogeneous network in which all links are subject to 
independent Rayleigh fading, with identical channel statistics. 
Moreover, ft. s („) (n) and h s r m ) ( m ) are independent for n ^ rn, 
and remain constant over the channel coherence time. This is 
a practically valid assumption because the likelihood of one 
link (say link m) achieving two consecutive successful channel 
probing, PmJ2i=£ m (^ ~ Pi) 2 ' is fairly small, especially when 
the number of links in the network is large. Furthermore, even 
if the same link successfully obtains two consecutive channel 
contentions, the channel conditions corresponding to the two 
consecutive successful channel probings are independent since 
the channel probing duration in between is designed to be 
comparable to the channel coherence time. 

Without loss of generality, we focus on the n-th probing 
round and omit the temporal index n, whenever possible, for 
notational simplicity. For presentational simplicity, we use Y n , 
X n and f„ and h n to denote Y s{n) {n),X s{n) {n), £ s(n )(rc) and 
h s ( n ) (n), respectively, in the sequel. 

When perfect CSI is available to the source node as assumed 
in [17], the instantaneous supportable data rate is given by the 
Shannon channel capacity: 



R n = W\og(l + p\h n \ 2 ), 



(5) 



where W is the bandwidth, and {Rn, n = 1, . . . , } are inde- 
pendent due to the independence assumption on h n . 

To facilitate our analysis, we concentrate our following 
investigation in the low SNR (wideband) regime, assuming 



2 This decision can be broadcast to all users in the one-hop neighborhood 
(e.g., NCTS). 



3 We note that the results reported here can be extended to frequency- 
selective fading channels by replacing scalar fading parameters with vectors. 
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p — > and W = O(-). It is well known that a decrease in 
SNR estimation error can only increase the rate of commu- 
nication. For cases with wideband signaling (e.g. in the low 
SNR regime), where an increase in the SNR results in a linear 
increase in the throughput, obtaining more accurate estimates 
of the SNR can yield substantial benefits. 



B. DOS with one-level probing 

In this section, we briefly examine DOS with one-level 
channel probing in the low SNR regime (cf. [18]). Let M be 
the training length. Thus r t = MT S , where T s is the symbol 
duration. We assume that M is large enough to ensure that 
r t — 6(1). Assume that the rate estimation is performed based 
on the MMSE principle. For simplicity, we assume that the 
probing symbols X m = 1, without considering pilot design. 
Then, it is straightforward to show that, hit , the minimum 
mean square error (MMSE) estimate of h is given by [8] 



pM 



(6) 



m— 1 



Substituting into we can express h in terms of 



as 



h = W+W, 



(7) 



where hn is the corresponding estimation error. Note that the 
orthogonality principle holds, and we can verify that hjt' and 
hn^ are uncorrected, where 



~ CM I 



and 



hW~CM(0, 



pM 
pM + 1 

1 

pM + 1 



(8) 



(9) 



Without perfect CSI, the source node employs the estimated 



SNR {p\h 



(1)|2 



.} as the basis for distributed schedul- 



ing, despite the fact that the actual SNR is given by 

p|^ 1} | 2 



a! 1 



i 



(10) 



MY 

where we have modeled the channel estimation error as 
additive Gaussian noise [13]. 

Inspection of (TTOt reveals that is always smaller than 
the estimated SNR {p\hn ^ | 2 }, in the presence of channel 
estimation errors. As a result, an outage occurs if the source 
node transmits at a data rate specified by {plft^ 1 ' | 2 }. To 
circumvent this problem, a linear backoff scheme is used to 
reduce the data rate [18]. More specifically, the estimated SNR 
is linearly backed off to aMp\hn | 2 , where <7j\/ is the backoff 
factor with < er M < 1. Under imperfect information, the 
transmission rate in the low-SNR wideband region simplifies 
to 



R^^pWa M \h^ 



(11) 



For more details of arriving at the above equation, we refer 
to Appendix lAl It can be shown that the optimal DOS policy 
with noisy channel estimation remains threshold-based with 
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Fig. 3. A sketch of DOS with two-level probing. 

the optimal threshold 9 given as the solution to the following 
optimality equation (cf. [18]): 

6t 



E 



PsT' 



(12) 



III. DOS with Two-Level Probing 

In this section, we characterize the optimal DOS with two- 
level probing, i.e., the links may choose to refine their rate 
estimates before making a decision on whether to transmit or 
not. We illustrate, in Fig. [3] the underlying rationale behind 
DOS with two-level probing. In the following, we detail the 
procedure with second-level probing, and then cast DOS with 
two-level probing as a problem of maximal rate of return 
using optimal stopping theory with incomplete information. 
We then characterize the corresponding structure and provide 
a complete description of the optimal strategy. 

A. Second-level probing 

To improve the estimation accuracy, the receiver of the 
successful link can request its transmitter to send another 
pilot packet, at the cost of a part of the data transmission 
duration allotted to it. More specifically, in addition to the pilot 
symbols sent during the first-level probing, the receiver obtains 
a refined estimate of h n by exploiting the newly transmitted 
pilot symbols of length r t during second-level probing (of 
duration r). Then, the link uses the remaining T— t for the data 

f (2) 

transmission. We let h n denote this refined estimate of h n , 
obtained via two-level probing. Following the same procedure 
described in the previous section, we can show that 

_ 2M 

f (2) VP 

2pM + 



T 5> 



(13) 



5 



~ (2) 

We can show that the estimate hh, , and the corresponding 
estimation error are uncorrelated, where 

p2M 



p2M+l 



and 



~ CM (o, 

~ CM (o, — \ ) 

V p2M + 1J 



Finally, the resulting data rate is computed as 

RW=pWa 2M \hWf, 



(14) 



(15) 



(16) 



where a%u is the corresponding linear rate back-off factor. 

Next, we establish the relationship between the estimates 
due to first-level and second-level probings. To this end, we 
apply the principle of linear estimation to represent h n as the 

as 



linear combination of orthogonal components h e and h 

fc<? = HP + h e , (17) 



where h e — CM(0, a\), with a\ 
By orthogonality, we have 



(Mp+l)(2Mp+l) ' 



E[\h% 



2)121 



E\ 



(l)|2l 



(18) 



For convinience, we introduce a new random variable z = 
\ / &2AiWph e , where z ~ CM(0, R e ), with R e — a 2 MWpa^. 
Thus, it follows that the expected rate of the second-level 
probing conditioned on the rate due to first-level probing, 
obeys the following relationship: 

E[R^\R^]=CrR^+R e . 

We note that R e can be interpreted as the expected relative 
rate gain due to the second level probing. 



B. Scheduling options and rewards 

In what follows, we devise DOS with two levels of probing 
using optimal stopping theory. Drawing on the ideas from [5], 
we show that optimizing the network throughput via DOS can 
be cast as a maximal rate of return problem. 

Consider the network model in Fig. Q] It takes a total 
duration of Y%=i to reach the n-th round of probing. 
After the n-th round of probing, the successful link has the 
following three options after computing its rate Rn- 

1) Transmit at rate R n ^ for time T; 

2) Defer transmission and let all nodes re-contend; 



3) Perform second-level probing to obtain the new rate 

(2) (2) 

Rii , and then decide whether to transmit at R n for 
time T — r or to defer and re-contend. 

Clearly, the basis for distributed opportunistic schedul- 
ing with two-level probing is the observation sequence 
^Rn\ -Rl 2) | , with the option of skipping RnK We introduce 
9 as the Lagrange multiplier, which can be interpreted as 
the cost for using one unit of the system time (that includes 
the contention, probing and transmission times). Then, the 
first successful channel probing incurs an average cost of 
Or/ps, the second-level probing incurs 9t, whereas the data 
transmission for a duration of T<j entails a cost OTj. 



Let <j>„ : TZ + -> {0, 1,2} and : TL + -> {0,1} be the 
decision sequences after K^' = x is observed. In particular, 
4>n{x) = 1 refers to transmitting at the current rate, <fi n (x) 
means giving up the transmission and re-contend, while 
4>n{x) = 2 indicates engaging in the second-level probing. 
Furthermore, when (j) n (x) — 2, the final decision hinges on 

(2) 

Rn = y: if ip n (y) = 1, the link transmits at the refined rate, 
whereas if ipniy) = 0, the link gives up the transmission and 
lets all nodes re-contend. 

Next, let N denote the stopping rule 

N = inf{n > l\<p n = 1, or <\> n = 2 and ip n = 1}. 
Then, the expected net reward (expected return) is given by 

r = E [R N T diN - 9T N ] , (19) 

where R n is the transmission rate after the n-th probing round 
and is given by 

I{<t> n = 1) ■ R n 1] + I{K = 2)I(i> n = 1) • R%\ 



where /(•) is the indicator function, and T n is the total time 
given by 



3=1 



which defined as the sum of total contention time (and time 
due to second-level probing, when performed) and the data 
transmission duration, Td,„ = T — I(<fr n = 2)I(^>„ = 1)t. 
The corresponding rate of return is E[RnTci,n]/ E\Tn\ The 
maximal expected return is given by 



r = sup E [R N T d ,N 



6T, 



N 



(20) 



Note that the expected return, r, depends on the decision 
functions <f>, ip, and the cost 9. The principal objective is to 
maximize the rate of return (i.e., the throughput) of the DOS 
with two-level probing, defined as 

E [RNTd^w] 



sup 



E [Tj 



Summarizing, we are interested in seeking a stopping rule 
N E Q that obtains 9*. The following lemma relates the 
optimal throughput 9* to the expected optimal return r$, and 
guarantees the existence of such an optimal stopping rule. 

Lemma 1: For DOS with two-level probing, the optimal 
stopping rule N* exists. Furthermore, 9* is attained at N*, 
and 9* satisfies 

r a = sup E [R N T d ,N - 0*T N ] = 0, 
NeQ 

Proof: See Appendix iBl 

Next, we derive the optimality equation for DOS with 
two-level probing. Without loss of generality, we assume the 
transmission duration to be unity, i.e. T = 1. 

We begin by considering the option of second-level probing 
and introducing its associated reward function. Suppose after 
observing R^ — x, the link performs a second-level probing 

(2) 

to obtain R n ' , and then uses an optimal strategy thereafter. 

(2) 

Then, depending on R n — y, it may choose to transmit at rate 
y, for a duration of 1 — t, if the associated reward is greater 
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than r (the expected net reward); otherwise it would defer and 
re-contend. The reward associated with the data transmission 
is (1 — t)u — (1 — t)6 in this case. In a nutshell, the expected 
net reward corresponding to the second-level probing is then 
given by 



Je(x,r) 



rG(- 
+ (1 



+ e\x) 



(y-6)G(dy\x) 



(21) 



where G(y\x) is the conditional cumulative distribution func- 
tion (cdf) of Rn \ given R^' = x. Note that G{y\x) is non- 
central x 2 with two degrees of freedom. Furthermore, both 
R { n ] and ilL 2) are exponentially distributed. We use F and F\ 
respectively, to denote the cdfs of R^P and r!£ \ Finally, it 
can be shown that lim G(y\x) = and E [y\x] = c r x + R e . 

x — >oo 

In a nutshell, upon observing R„ — x after the rt-th 
probing round, the link s(n) can obtain one of the following 
three rewards: 

1) x — 9: the reward by transmitting at a rate x; 

2) r : the reward obtained by forgoing the current oppor- 
tunity and re-contending; 

3) Je(x, ro): the reward by resorting to refining the rate via 
second-level probing. 

Note that, in computing the rewards above, we have omitted 
the cost for obtaining the first successful channel probing, i.e. 
6t/p s , since it is common to all three returns. The optimal 
strategy for the link is to choose the option that yields the 
maximum of the above rewards. It follows that the optimal 
DOS strategy with two-level probing satisfies the following 
optimality equation [12]: 



E 



-8,r ,Je(R {1 \r )} 



Ps 



(22) 



where R^ has same distribution as Rn . 

From Lemma Q] when the throughput, as a function of 9, 
reaches its maximum, we have that r = at 9 = 6* [5]. 
Thus, (l22l can be rewritten as 



E 



:{R^-9*,J e ,(R^,0)} 



Ps 



(23) 



Inspection of d23l indicates that the second-level probing is 
optimal when Jg* (x, 0) > and Jg* (x, 0) > x — 9* for some 
x. 

It is worth noting that the following fact holds: 

A E[RW] 



> Or. = 



+ 1 



(24) 



Note that 9l corresponds to the throughput of PHY-Oblivious 
scheduling, which is a single-level probing scheme with zero 
threshold. This can be achieved by the degenerate stopping 
rule, which stops at the very first time. 

C. Structure of optimal scheduling strategy 

We next proceed to study the structure of the optimal 
scheduling strategy. Essentially, the optimal strategy takes a 



threshold form. Depending on the specific network setting, 
the optimal strategy may admit one of the two intuitively 
reasonable types, namely Strategy A and Strategy B. Generally 
speaking, under Strategy A, it is always optimal to demand 
additional information when the estimated rate lies between 
two thresholds. This is the case when the gain due to second- 
level probing is comparable with the additional overhead. In 
contrast, under Strategy B, there is never a need to appeal 
to second-level probing. This case occurs for example, when 
the improvement due to the refinement is dominated by the 
probing overhead. An extreme example of this case is when 
perfect CSI is available to the transmitter. 

Before we state the main result on the optimal strategy, we 
define q(x) = Jg*(x,0) — x + 9*. Intuitively speaking, q{x) 
represents the expected gain achieved by second-level probing 
compared to directly transmitting at the current rate. Thus, if 
q{x) > 0, performing second-level probing is a better option 
than directly proceeding to data transmission. We need the 
following lemmas before characterizing the structure of the 
optimal scheduling strategy. 

Lemma 2: Jg* (x, 0) and q(x) are characterized by the 
following properties: 

i) Jg*(x,0) is monotonically increasing in x with 
lim Jg*(x,0) = 00, and lim Jg*(x,0) < when 

x^oc x^Q 

Re 



< 



1-T - 



ii) For c r < q{x) is monotonically decreasing in x 

with lim q(x) > and lim q(x) — —00. 

x— >0 x— >oc 

Proof: See Appendix ICl 

Remarks: Observe that the above conditions are stated in 
terms of the design variables (e.g., t and c r ). It is clear 
that R e < 9*, since R e is the relative gain due to rate 
refinement and cannot be greater than the optimal throughput 
9*. It follows that e _7 ^ < e -1 . For example, to satisfy the 
condition in i), it would suffice to have r > 1/(1 + exp(l)). 
This range of r is applicable to many scenarios (e.g., IEEE 
802.11). Moreover, it is easy to satisfy the condition in ii) by 
choosing c r < 1/(1 — r) — S, where S > 0. 

Lemma 3: There exists at most one solution, in terms of 
{xj, x q , 9*}, to the following system of equations: 

^{1-G{u\xj))du=^, 

(c r (l - r) - l)x q + (1 - r) (i? e + Jo* G(u\x q )duj = 0, 

Jg, («, 0) dF(u) + - 9*) dF(u) = e -^/ 

(25) 

Recall that xj and x q are the solutions to Jg* (x, 0) = and 
q{x) = 0, respectively. From Lemma |2] it is easy to see that 
there is at most one pair {xj,x q } satisfying (l25l l. Similarly, 
since Jg*(x,0) and q(x) intercept at x — 9*, there exists at 
most one 9* due to the monotonic nature of Jg, (x, 0) and 
q(x). 

For convenience, let {xj, x q , 9*^} denote the solution to (|25| > 
with xj < x q , and 9* B be the solution to (fT2l . Using the above 
lemmas, we obtain the following result on the structure of 
optimal scheduling strategy. 

Theorem 1: The optimal strategy for DOS with two-level 
probing, takes one of the two forms: 
[Strategy A] It is optimal for the successful link 
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Reject and Acquire second-level Proceed to 

re-contend channel information data transmission 



Fig. 4. A structural sketch for Strategy A. 

i) to transmit immediately after the first-level probing if 
Rn' 1 > x q ; or 

ii) to give up the transmission and let all links re-contend 
if Rn^ < xj; or 

iii) to engage in second-level probing if RfP € [xj,x„]; 

(2) (2) 

upon computing the new rate R n , transmit at rate R n 

(2) 

if R n > 9% or to give up the transmission otherwise. 

Furthermore, the throughput under Strategy A is 9* A . 
[Strategy B] There is never a need to perform second-level 
probing. That is, it is optimal for the successful link to 
transmit at the current rate R^t if Rn > 9* B , or to defer 
its transmission and re-contend otherwise. Furthermore, the 
throughput under Strategy B is 6* B . 
Proof: See Appendix ID1 

D. Optimality Conditions 

In previous sections, we have studied DOS with two-level 
probing within the OST framework, and characterized the 
structure of optimal scheduling strategies. Our findings reveal 
that optimal scheduling may take either of two forms: Strategy 
A or Strategy B. The next key step is to determine the 
conditions under which it is optimal to use Strategy A or 
Strategy B. We show that this can be easily determined by 
performing a threshold test on the function Jg*(-, •). We have 
the following theorem. 

Theorem 2: Strategy A is optimal if Jg* (0^,0) > 0; else, 
Strategy B is optimal. 

Proof: See Appendix [E] 

E. Numerical Results 

In this section, we provide a numerical example to illustrate 
the effectiveness of the proposed DOS with two-level probing 
under noisy estimation. Specifically, we compare the perfor- 
mance of the proposed DOS with two-level probing, with that 
of DOS with one-level probing and PHY-oblivious scheduling. 
The baseline for comparison is the PHY-oblivious scheduling 
that does not make use of any link-state information. We focus 
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Fig. 5. A structural sketch for Strategy B. 



on the relative gain over PHY-oblivious scheduling, which is 
a function of pM, and is defined as 



T(pM) = 




We set p s = exp(-l), M = 300 and W = 3000, so 
that Tt = 0.1 and r = 0.2. Fig. [6] depicts the performance 
comparison. It is clear that the relative gain achieved by DOS 
with two-level probing substantially outperforms that obtained 
by DOS with one-level probing. Observe that the performance 
gain is significant in the low SNR regime (i.e., smaller values 
of a). As a increases, the relative gain of DOS with two- 
level probing approaches that of DOS with one-level probing, 
and our intuition is that, for higher values of a, the cost of 
overhead offsets that of the rate gain due to additional probing. 
Accordingly, the "gray area" between two thresholds (xh and 
x q ) collapses, and the optimal strategy degenerates to Strategy 
B, which is essentially DOS with first-level probing. 




0.2 0.4 0.6 0.8 1 1.2 1.4 



ct - p M 

Fig. 6. Relative gain f as a function of a = pM. 
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IV. DOS with Two-Level Probing: The Limited 
Feedback Case 

In the above studies, it is assumed that for the link with 
successful contention, its transmitter has the knowledge of the 
rate estimate for data transmissions. In some practical sce- 
narios, there is only limited feedback from the receiver to the 
transmitter. With this motivation, we extend the study on DOS 
with two-level probing, to the case where the feedback from 
the receiver to its transmitter takes the form (0, l,e). More 
specifically, the decisions from the receiver to the transmitter 
are conveyed by using "NACK/ACK/ERASURE" signaling, 
where "NACK" is represented by "0" corresponding to the 
decision of defer and re-contend, "ACK" by "1" corresponding 
to the decision of transmit, and "ERASURE" by "e" indicating 
that the rate estimate falls in the gray area. 

A. One-level probing 

We first consider DOS with one-level probing, with one- 
bit feedback from the receiver to its transmitter. The basic 
idea is as follows. A constant transmission rate, denoted as 
Ri, is pre-determined and known to the transmitter, and the 
data transmission takes place only when the one-bit feedback 
is "1". A central problem here is to design the transmission 
strategy for maximal throughput. Let 7 be the price function 
per unit time. Then, given its current rate estimate Rn , the 
successful link in the the n— th probing has two options: 

> "1" — transmit at rate R\, and the corresponding reward 

is Ril^ > Ri) - 7; 
• "0" — defer and re-contend, with the expected reward of 

ro- 

Clearly, there is an average cost of jp s for every successful 
contention. 

Let 7 = sup E[R N T]/E[T N ] be the optimal throughput. 

JveQ 

Then, based on Theorem Q] the optimality equation is given 
by 



E 



> i?i) 



7 



7T 

Ps ' 



(26) 



As a result, we can show that the optimal policy in this case 
still has a threshold structure with i?i being the threshold. 
Furthermore, noting that Rn ~ exp(E[R^ 1 ' > ]), we conclude 
that the average throughput is given as 



Rie 



Hi 



JL -L g E[RW] 
Ps 

Observe that 7 is a function of i?i . For a given stopping rule, 
Ri can be chosen to maximize the throughput, i.e., the optimal 
transmission rate R\ and the corresponding throughput obey 
that 

Ri = argmax7; and j max = j(Ri). 
Ri 

It can be shown that R\ is the solution to 



Ri 



;E[R(1)] = '_L_ 

T 



(27) 



It follows that the optimal throughput is given by 



In 



i?x - E[RW] = tlElR^hem^] . (28) 

r 



B. Two-level probing 

Next, we study DOS with two-level probing, with the 
feedback taking the form of (0, 1, e). Along the same line as 
in the studies in Section [HI] the receiver of the successful link, 
depending on its rate estimate i?„ , presents three options to 
its transmitter: 

• "1" — transmit at the rate R%; 

• "0" — defer and re-contend; 

(2) 

• e" — perform a second-level probing to obtain R n ', and 
then decide: 

- "1" — to transmit at rate R\, 

- "0" — defer and re-contend. 

Define 7* = sup E[RNT d<N ]/ E[Tn], which represents the op- 

timal throughput for the given R\. By Theorem [[] this 
corresponds to ro = 0. Since, 7* is the function of the rate 
Ri, we further maximize the throughput over all choices of 
Ri, by defining -/* max = max 7*. 

Hi 

We can write the expected net reward function correspond- 
ing to the second-level probing as 

i>oo 

V r (x, Ri) = (l- t)(R% - 7*) / G(dy\x) - 7 *t, 

JRi 

which can be further simplified as 

V T {x,Rx) = (1-t)(R 1 - 1 *){1-G(R 1 \x))- 1 *t. 
The optimality equation in this case is given by 

E [max|i?iJ > -7*, V 7 .(i? (1) ,7?i)|] + = 7* — . 

The following lemma gives useful bounds on the optimal 
throughput. 

Lemma 4: For a given transmission rate R\, the optimal 
throughput obeys that 



where 

1L 



1L < 7 < 1U, 



(l-r)i?! 



Ju 



(i_ T ) +T h + ij e S[B27] 1 + l 



Remarks: 

a) The lower bound 7^ is obtained by using a strategy where 
the successful link always performs a second level probing, 
and then decides to transmit for a duration of 1 — r or to re- 
contend. 

b) The upper bound ju is achieved by a genie aided scheme, 
where the successful link contends only when its channel is 
good and there is no transmission outage. 

Next we turn our attention to structural properties of the 
optimal strategy. For convenience, define the relative gain 
function as 

q r (x, R x ) = Vy {x, Ri) - Ril(x > + 7*. 
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Lemma 5: V~ l *{x,R\) and q 1 *{x^R\) are characterized 
by the following properties: 

i) V~*(x, Rx) is monotonically increasing in x. Further- 
more, lim Vy*(x, R\) — c\ > 0, if r < 1 — p s , and 

X — >oc 

lim Vy,(x,Ri) < 0, when r > 0.5(ln(l + — ) - 1). 

x^O P s 

ii) q~f*{x,Ri) > for x < Rx; and q 1 *{x,Rx) < for 
x > Rx. 

Proof: See Appendix [F| 
The above lemma serves as the basis to determine the 
optimal DOS scheduling under the feedback of (0, l,e). 
Specifically, from the properties of Vy*(-,Rx), there exists 
some x„ such that 



i) feed back "1 " if R^P > Rx, indicating to transmit at 
rate Rx; or 

ii) feed back "0" if R^' < Rx, indicating to give up the 
transmission and re-contend. 

Furthermore, the throughput under Strategy B is 7. 

Proof: The proof follows the same line of that for 
Theorem Q] 

V. Conclusion 

We have considered distributed opportunistic scheduling 
for single-hop ad-hoc networks in which many links contend 
for the same channel using random access. Specifically, we 
have investigated DOS with two-level channel probing by 
optimizing the tradeoff between the throughput gain from 
more accurate rate estimation and the corresponding probing 
overhead. Capitalizing on optimal stopping theory with two- 
level incomplete information, we have showed that the optimal 
scheduling policy is threshold-based and is characterized by 
either one or two thresholds, depending on system settings. 
We have also identified optimality conditions. In particular, our 
analysis reveals that DOS with second-level channel probing is 
optimal when the first-level estimated rate falls in between the 
two thresholds. By a numerical example, we have illustrated 
the effectiveness of the proposed DOS with two-level channel 
probing. Finally, we considered the extension of DOS with 
two-level probing to the case where there is a limited feedback, 
of the form (0, 1, e), from the receiver to its transmitter. 

So far, we have considered DOS with two-level probing, 
where we assumed that the refinement of the rate estimate 
is carried out once, via second-level probing of duration r. 
However, we can further extend this to L— level probing, where 
for k = 1, . . . , L — 1 has the options 1) to transmit, or 2) to 
defer and re-contend, or 3) to resort to (fc+1)— st level training 
at the cost of additional overhead. It is of great importance to 
devise well-structured, yet simple policies. 

We note that the proposed distributed scheduling with 
two-level probing provides a new framework to study joint 
PHY/MAC optimization in practical networks where noisy 
probing is often the case and imperfect information is in- 
evitable. We believe that this study provides some initial steps 
towards opening a new avenue on exploring channel-aware 
distributed scheduling for ad-hoc networks to enhance spec- 
trum utilization; and this is potentially useful for enhancing 
MAC protocols for wireless local area networks (LANs) and 
wireless mesh networks. 

Appendix A 
Derivation of Rate Equation (ITTb 

Let /3W = E \hy^\ 2 , we follow the approach proposed 

in [13] and normalize |/iW| 2 and IM 1 ^ 2 as 

\hW\ 2 



Vy*(x,Rx) > 0, Vx > x v , 



(32) 



which, in turn, gives a threshold below which the option of 
"defer and re-contend" is optimal. From the properties of 
<7 7 * (•, it is also clear that for all x > Rx, it is opti- 
mal to transmit immediately without a second-level probing. 
Therefore, the interval [a^lii] defines the gray area where 
one could benefit from performing a second-level probing. 

We note that the throughput, denoted by 7*, is the parameter 
to be optimized over the thresholds x v and Rx. Combin- 
ing d29b and d32l . we establish (f30b and d3~TT l that relate 
three key parameters, namely the lower threshold x v , the 
transmission rate Rx, and the throughput 7*. It can be seen 
from d30j and (fJB that x v = fx(Rx,J*) and 7* = f2(x v ,Rx), 
indicating that 7* = g(Ri). Then, R\ can be chosen to the 
one maximizing g{R\), i.e., 

R\ = argmax 5 (i?i); and -)* max = g{R\). 
Ri 

Accordingly, the optimal x* is given by 

x v = /l(-Rl) Imax)- 

Let {x*, R^, 7^ OI } be the set of parameters obtained as 
outlined above. Also, let R\ be the solution to (127) . The 
optimal strategy in the case with limited feedback is given 
by the following result. 

Theorem 3: The optimal strategy for DOS with two-level 
probing, with (0, 1, e) feedback, takes one of the two forms: 
[Strategy A] It is optimal for the receiver of the successful 
link to 

i) feed back "1 " if R^P > R\, indicating to transmit at 
rate R\ immediately after the first-level probing; or 

ii) feed back "0" if R^ < x*, indicating to give up the 
transmission and let all links re-contend; or 

iii) feed back "e" if Rn € [x*,R*), indicating to engage 
in second-level probing; and upon computing the new 
rate R^ , 

(2) 

a) feed back "1 " if R n > Rl, indicating to transmit 
at rate R\; or 

b) feed back "0" if Rn < R\, indicating to give up 
the transmission and re-contend. 

Furthermore, the throughput under Strategy A is Jmax- 
[Strategy B] There is never a need to perform second-level 
probing. That is, it is optimal for receiver of the successful 
link to 



A« = 
C (D = 



1 - /?« 
|feW| 2 



(33) 



(34) 



where both and £W are exponential-distributed with unit 



10 



7 = 



(1 - t)Ri J"* 1 (1 - G{R!\u))dF{u) + Rie 



(!- T ) (C(l-G(iii|u))«iF(tt) + e ) +r(i + . 

(l-r)( J Ri-7*)(l-G(^i|x))= 7 *r. 



(30) 
(31) 



Defining the "effective channel SNR" and "normalized error 
variance" as 



P% = (1-0 (1) )P. 
1 - /3« ' 

respectively. Substituting d35l l and (f36t in (TToT > results in 



A« 

A (l) = Peff A 



(35) 
(36) 

(37) 



It has been shown in [13] that the conditional probability 
distribution function (pdf) of given A' 1 ' takes the follow- 
ing form 



/( A (1) A (1) 



A« _l 



A (1) 

a m [aw] 2 exp l <> lji I A" 



(38) 



where I(-) is the indicator function. 

The following linear backoff function is employed to pre- 
vent channel outage. 



A C (A«) = a M Pef f X {1 \ 



(39) 



where <jm is the backoff factor with < <tm < !• 

Define the optimal stopping rule over the er-field T' gener- 
ated by {{p\hj\ 2 ,Kj),j = 1,2, . . . ,n}. Using I® and $59$, 
the conditional expectation Rn ^ can be computed as 



R 



(i) 



i5[log(l + A c (A( 1 )))i( Ac (A( 1 ))<A( 1 ))|^] 



1 — exp 



-1 



Wlog(l+0- M p|ft (1) | 2 



For the low SNR wideband regime where p — > oo and W 
©(-), i?! 1 '' can be well approximated by 



Appendix B 
Proof of LemmaQ] 

For a given 6, let iV(0) be a stopping rule such that 

N(6) = arg sup E {R N T d , N - 0T N ] . 

N€Q 



(40) 



Let Z„ — R n Td, n ~ 0T n . Then, it follows from Theorem 1 
in [5, Chapter 3] that N(9) exists if the following conditions 
are satisfied: 

(Al) £?[supZ„] < oo, and (A2) lim sup Z n = — oo, a.s., 

n n — >oo 

Since, it is clear that lim sup Z„ = — oo, we can easily verify 

n — >oo 

(A2). 

For some < fi < l/p s , we introduce 

Z' n = max{i?W,4 2 )}T - n{6- - p) 



and 



3=1 



Then, we note that 

E 



supZ n 


< E 




+ E 


supz;; 


n 




n 




n 



Appealing to Theorem 1 and Theorem 2 of [5, Chapter 4], 
we conclude that E [Z' n ] < oo and E \Z'£\ < oo, respectively. 
Therefore (Al) holds. 

The second part of the lemma follows directly from Theo- 
rem 1 in [5, Ch.6]. 

Appendix C 
Proof of Lemma|2] 

a) Using Fubini's theorem, we rewrite Jg(x,r) as 

poo 

J 6 '(x,0) = (1 -r) / (1 - G{u\x))du - 9*t. (41) 

Since G(y\x) decreases monotonically with x, Je*(x, 0) 
is also monotonically increasing in x. Note that lim (1 — 

x — >oo 

G(u\x)) = 1. Then, by Lebesgue's convergence theorem, 
we have linx^oo Jg* (x, 0) = oo. Next, recall that \z\ 2 ~ 
exp(i? e ). It follows that lim G(y\x) = G\ z <i(y) = 1 — e~«?, 

X — >0 

and consequently, 

lim J e . (x, 0) = (1 - T)i? e e~-& - 6*t. (42) 

_ Q* 

Thus, under the condition R e /9*e R e < r/(l — r), 

lim Je*(x,Q) < 0. (43) 

x^0 

b) Using Fubini's theorem, we can rewrite Jg*(x,r) as 
Jg*{x,0) = (1-r) fcra; + -R e - 6»* + J^G(u\x)du j -6»*r (44) 
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It follows that 

q(x) = (cr(l - t) - 1> + (1 - r)i? e + (1 - r) / G(u|a;)du. 

Jo 

We can verify that 

lim q(x) = RJl - t) + (1 - t)6* > 

a;— »0 

Furthermore, when c r < it is clear that 

lim q(x) = — (X). 

a: — >oo 

Since G(y|x) is monotonically decreasing in x, we conclude 
that q(x) is also monotonically decreasing in x. 

Appendix D 
Proof of Theorem[T] 

Let xj and x q be solutions to Jg*(x, 0) = and q(x) = 
respectively. From Lemma [2] we have 



and 



{< if x < xj 
= if x = xj 
> if x > xj 



< if x > i, 



g(a:) < = if ; 



(45) 



(46) 



> if X < Xq 



Thus, one of the following two possibilities holds. 
1) The case with x q > xj: 

From the above discussions and the monotonicity prop- 
erties of Jg.(-,0) and q(-), it follows that 

| x — 9* if x > x q 
max [x — 8* , Je* (a;, 0)] + = < Je* (x, 0) if x £ [xj, x q ] 







if x < xj 



(47) 

Furthermore, from (|47| | and the optimality equa- 
tion (l23l>. we have that 



f 

•J X ; 



Je*(u,0)dF(u)+ (u-6*)dF(u) = . (48) 

Ps 



Consequently, it is clear that the optimal strategy is 



1 (transmit) if JJ„ ' > x q 
MRn } )={ 2(2-level) if G [xj, x q ] (49) 

(re-contend) if Rn < xj 



and when (j)n(Rn^) = 2, the strategy is 



1 (transmit) if rS ] > 8* A 



(re-contend) if i?l 2) < 6* A 



(50) 



where 0* A is the solution to d48b . It can be seen 
that thresholds xj and x 9 are found as the solutions 
to Je» (x, 0) = and q(x) = respectively. Thus, 
{xj, Xq, 6* A } is the solution to the system d2~5l >. An 
illustration of Strategy A is depicted in Fig. [4] 

2) The case with x q < xj: 
From (1451) and (146b. we have 



max [x — 6", Je* (x, 0)] + = | q 



x — 8* if x > 9* ... 

ifx<0* P1J 



and Je*(x,0) < max [x — 6* , 0] . Therefore, it is never 
optimal to perform second-level probing. From (l5Tb and 
the optimality equation d23b we obtain 



(x-e*)dF(x) = , 



which is equivalent to ( fT2l . Thus from ((5TJ, the optimal 
strategy is 



4(R 



1 (transmit) 

(re-contend) if R^ 1 < 8* B 



if R^ > 

(i) 



(52) 



where the threshold 6* B is the solution to ( fT2b . An 
illustration of Strategy B is depicted in Fig. [5] 



Appendix E 
Proof of Theorem[2] 

Suppose Je*(8 A ,0) > 0. Then, this implies that 
Je* (S A ,0) > max[i — 8 A ,0] when x = 0* A . Specifically, 
when r[^ — 8 A , performing second-level probing and using 
an optimal strategy thereafter yield an expected reward of 
Jq* a (8 a ,0), which is at least as good as using Strategy B. 
Equivalently, we show that there exists at least one value of x 
(9* A in this case) for which performing second-level probing 
is optimal. We conclude that Strategy A is optimal. 

Next, we assume Strategy A is optimal and show that 
Je*{0* A ,Q) > 0. Under such an assumption, there must exist 
some x\ for which it is beneficial to demand additional 
information, i.e. 

Jg^(a;i,0) > max[ii —8 A ,0]. (53) 

We now investigate Jg* A (6* A , 0) in two different cases, namely 
9* A > xi and 6* A < x%. 

1) The case with 6* A > xi: 
In this case, 

Je* A (0*a, 0) > Je A {xi, 0) > max^i -8 A ,0]=0, (54) 

where the first and second inequalities are due to the 
monotonicity of J(-,0) and the assumed optimality of 
Strategy A, respectively. 

2) The case with 0* A <X\\ 
In this case, 



Je* (0* A , 0) > J fl . (x 1} 0) -X! + e%>0, 



(55) 



where the first inequality follows from the fact that 
Jg A (x,0) — x + 9 is decreasing in x and the second 
inequality is due to 



Summarizing the above two cases, we conclude that 
Je A (^A' 0) > is a necessary condition for the optimality of 
Strategy A. Using contra position, we conclude that Strategy 
B is optimal if J e * (0 A ,0) < 0. 
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Appendix F 
Proof of Lemma|5] 

It is clear that Vy (x, Ri) is monotonically increasing in x, 
and that 

lim Vy*(x,Ri) = {1-t)Ri -7*. 

x — >oo 

Since 7* < ju, it follows that lim V 1 *(x 1 Ri) > 0, provided 

x — >oo 

that t < 1 — p 8 . Furthermore, observe that 

UmV~.(x,Ri) = {1-t){R 1 - 7 *) e -^ - 7 * T 

X— *0 

< 7* f(l- r )(— - l)e~5t _ T 

/ 1 "1 

= T7* (1 H )e B l H<2) ] e «■= - 1 

V Ps 



< 



t 7 * ((1 + j)e- {1+2r) ~ l) , 



where the last inequality follows due to the fact that 
1 and Ml^l w (1 + 2r). We conclude that 



g[fl (2) ] < 



[12] W. Stadje, "An optimal stopping problem with two levels of incomplete 

information," Mathematical Methods of Operations Research, vol. 45, 

no. 1, pp. 119-131, 1997. 
[13] A. Vakili, M. Sharif, and B. Hassibi, "The effect of channel estima- 
tion eiTor on the throughput of broadcast channels," in Proc. IEEE 

ICASSP'06, Toulouse, France, May 2006. 
[14] P. Viswanath, D. N. C. Tse, and R. Laroia, "Opportunistic beamforming 

using dumb antennas," IEEE Trans. Info. Theory, vol. 48, no. 6, pp. 

1277-1294, Jun. 2002. 
[15] B. Wang, J. Zhang, and L. Zheng, "Achievable rates and scaling laws 

of power-constrained wireless sensory relay networks," IEEE Trans. Inf. 

Theory; vol. 52, pp. 4084^104, 2006. 
[16] Z. Ye, A. Abouzeid, and J. Ai, "Optimal stochastic policies for 

distributed data aggregation in wireless sensor networks," IEEE/ACM 

Trans, on Networking, to appear, 2008. 
[17] D. Zheng, W. Ge, and J. Zhang, "Distributed opportunistic scheduling 

for ad-hoc communications: An optimal stopping approach," in Proc. 

Mobihoc 2007, Montreal, Canada, Sept. 2007. 
[18] D. Zheng, M. O. Pun, W. Ge, J. Zhang, and H. V. Poor, "Distributed 

opportunistic scheduling for ad hoc communications with imperfect 

channel information," IEEE Transactions on Wireless Communications, 

vol. 7, pp. 5450-5460, December 2008. 



lim V r *(x,R!) < 0, for t > 0.5 fin f 1 + — ] - 1 

x^a \ \ Ps J 

The second part follows from the facts that for x > Ri, 

q-f* (x, Ri) 

= (l-r)(l-G(iii|*))(7*-iii)-T.Ri<0 ! 
and for x < Ri, 

<? 7 *(z,i?i) 

= (1 - t)Ri (1 - G (Ri\x)) + (1 - t)G {Ri\x) 7* > 0. 
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